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DETAILED ACTION 
Response to Arguments 

1 . Applicant's arguments filed 5/1 1/06 have been fully considered but they are not 
persuasive. 

Regarding claims 1-13, 15, and 17-20, the examiner respectfully disagrees. The 
Mel scale is a perceptual scale of pitches, and therefore the use of MFCCs reads on the 
limitation of "a critical band filtering process that filters the audio data proximate a 
human auditory frequency range". Mauuary et al., USPN 6,157,909 is used as 
evidence to this statement. Mauuary et al. teaches the use of non-linear perceptual 
scales, which are either the Mel scale or the Bark scale. Mauuary et al. further teaches 
the MFCC calculation process using the Mel scale (Col. 2, lines 9-29 and Fig. 1b). 

2. Applicant's arguments with respect to claims 21-35 have been considered but are 
moot in view of the new ground(s) of rejection. See the new grounds of rejection under 
35 USC 101 and 35 USC 103. 

Claim Rejections - 35 USC §101 

3. 35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of 
matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the 
conditions and requirements of this title. 

4. Claims 1-13, 15, 17, 19-26, and 28-35 are rejected under 35 U.S.C. 101 
because the claimed invention is directed to non-statutory subject matter. 

5. Claim 1 is seemingly a patentable process, however, it is in reality seeking 
patent protection of the computer program as evidenced by claim 18. It is non-statutory 
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because it lacks a practical application by physical transformation. There is no positive 
recitation of an output or outcome that provides a practical application (i.e. providing a 
use for the spectral feature vector). Furthermore, it is non-statutory because a practical 
application that produces a useful, concrete, and tangible result has not been claimed. 
Claim 18 will need to be amended in a way such that the practical application, which 
claim 1 needs, is not performed by software. 

6. Claims 2-13, 15, 17, and 19-20 are non-statutory because they depend on claim 
1. 

7. Specifically regarding claim 19, the claimed subject matter is "a modulated data 
signal carrying computer executable instructions for performing the method*'. Signals 
are not covered by the statutory categories and is not protected by judicial exemptions. 
Therefore, a modulated signal is non-statutory matter. 

8. Claim 21 is non-statutory for similar reasons as claim 1 as evidenced by claim 
27. Claim 27 will need to be amended in a way such that the practical application, 
which claim 21 needs, is not performed by software. 

9. Claims 22-26, 28, and 29 are non-statutory because they depend on claim 21 . 

10. Specifically regarding claim 28, the claimed subject matter is "a modulated data 
signal carrying computer executable instructions for performing the method". Signals 
are not covered by the statutory categories and is not protected by judicial exemptions. 
Therefore, a modulated signal is non-statutory matter. 

1 1 . Claim 30 is seemingly a patentable apparatus, however, it is in reality seeking 
patent protection of the computer program as evidenced by the specification on page 
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23, lines 24-31 . It is non-statutory because it lacks a practical application by physical 
transformation. There is no positive recitation of an output or outcome that provides a 
practical application (i.e. the result of the processing means is unclear). Furthermore, it 
is non-statutory because a practical application that produces a useful, concrete, and 
tangible result has not been claimed. 

12. Claims 31 and 32 are non-statutory because they depend on claim 30. 

13. Regarding claim 33, the claimed subject matter is "a classification chain data 
structure... comprising". Data is not covered by the statutory categories and is not 
protected by judicial exemptions. Therefore, a data structure is non-statutory matter. 

14. Claim 34 is seemingly a patentable process, however, it is in reality seeking 
patent protection of the computer program as evidenced by the specification on pages 
36-38, which discloses the classification chain as software for controlling a digital signal 
processor. It is non-statutory because it lacks a practical application by physical 
transformation. There is no positive recitation of an output or outcome that provides a 
practical application (i.e. the result of the classification chain processing is unclear). 
Furthermore, it is non-statutory because a practical application that produces a useful, 
concrete, and tangible result has not been claimed. 

15. Claim 35 is non-statutory because it depends on claim 34. 

Claim Rejections - 35 USC § 102 

16. The text of those sections of Title 35, U.S. Code not included in this action can 
be found in a prior Office action. 
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17. Claims 30-33 are rejected under 35 U.S.C. 102(e) as being clearly anticipated by 
Gjerdingen. 

18. Regarding claim 30, see Gjerdingen 

A computing system, comprising: 

a computing device including: 

a classification chain data structure stored thereon having a plurality of classification 
vectors, wherein each vector includes data representative of a spectral properties class as 
classified by humans and spectral properties characteristics as determined by digital signal 
processing; and (Col. 3, lines 50^54 and Col. 9, lines 28-39) 

processing means for comparing an unclassified media entity to the classification chain 
data structure to determine an estimate of the spectral properties class of the unclassified media 
entity. (Col. 6, line 38 - Col. 7, line 2) 

Gjerdingen teaches a computing system with these features to create a searchable 

database. Gjerdingen teaches either human or machine classification (Fig. 4, items, 

403 and 403B and col. 6, lines 38-64). 

19. Regarding claim 31 , the further limitation of claim 30, see Gjerdingen 

. . . wherein said determining of an estimate of the spectral properties class includes returning at least one 
number indicating the level of confidence of the spectral properties class assignment (Col. 10, lines 
53-57) 

Gjerdingen teaches a level of confidence indicator. 

20. Regarding claim 32, the further limitation of claim 31 , see the preceding 
argument with respect to claims 30 and 31 . It is inherent that a system using the 
method taught by Gjerdingen will undergo an improvement in classification with experts 
review and more data samples (Col. 8, lines 19-24). 

21 . Regarding claim 33, see the preceding argument with respect to claim 30. 
Gjerdingen teaches a system with human and DSP classification techniques regarding 
the spectral properties of media entities. 
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Claim Rejections - 35 USC § 103 

22. The text of those sections of Title 35, U.S. Code not included in this action can 
be found in a prior Office action. 

23. Claims 1-13, 15, and 18-20 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over the combination of Blum and Kjaer, U.S. Pat. No. 4,603,386. 

24. Regarding claim 1, see Blum 

A method for automatically classifying spectral properties of audio data, comprising: 

applying input audio data (1)toa critical band filtering process to form first output data and (2) to 

an entropy calculation process to form second output data; (Col. 6, lines 24-28) 

applying the first output data to a first derivative process to form third output data; (Col. 6, lines 

28-30) and 

inputting said first, second and third output data to an averaging process to form a spectral 
feature vector representing the input audio data. (Col. 6, lines 32-35 and lines 45-48) 

Blum teaches a method for automatically classifying spectral properties of audio data, 

wherein a feature vector is created with the above features. The critical band filtering 

process, as taught by Blum, is a Mel-frequency cepstral coefficient process. Blum does 

not teach the entropy calculation for use in a feature vector, however Blum has 

described a feature vector with a plurality of metrics. Kjaer teaches an entropy 

calculation, wherein a musical tone is classified by notes and accidentals (see Abstract 

and Col. 4, line 55 - Col. 7, line 34). Kjaer teaches that entropy is useful in classifying 

information composed of random processes, or processes that can be better 

understood using probability theory. It would have been obvious for one of ordinary skill 

in the art at the time of the invention to combine the teachings of Blum and Kjaer for the 

purpose of better classification. 

25. Regarding claim 2, the further limitation of claim 1 , see Blum 
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. . . wherein the audio data is divided into frames, and the method is performed frame by frame. (Col. 6, 
lines 56-58) 

In the combination, Blum teaches the division of audio data into frames, wherein the 
method is performed frame by frame. 

26. Regarding claim 3, the further limitation of claim 1 , see 

... further including calculating root mean squared values of the input audio data. (Col. 8, lines 1-3) 
In the combination, Blum teaches RMS values. 

27. Regarding claim 4, the further limitation of claim 2, see Kjaer 
... wherein said entropy calculation process includes calculating: 

s - - Zw Pw log2(p w ) 

where S is the entropy of the frame, p w is the normalized magnitude of a bin w of the audio data, and 
log 2 (p w ) is the log base 2 of(p w ). (Col. 5, lines 5-12 and equation H(x)) 

Kjaer teaches this entropy measure. 

28. Regarding claim 5, the further limitation of claim 2, see the preceding argument 
with respect to claim 3. Blum teaches the square root of the sum of squares, where the 
square root is a mapping function and adjusts the scale of the function. 

29. Regarding claim 6, the further limitation of claim 2, see the preceding argument 
with respect to claim 1 . The combination teaches this feature. 

30. Regarding claim 7, the further limitation of claim 1 , see the preceding argument 
with respect to claim 1 . The combination teaches a frequency domain transform. 

31 . Regarding claim 8, the further limitation of claim 7, see Blum 

... wherein said converting of the input audio data signal from the time domain to the frequency domain 
includes performing a fast fourier transform on the audio data. (Col. 7, lines 56-61 ) 

In the combination, Blum teaches an FFT. 
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32. Regarding claim 9, the further limitation of claim 2, see the preceding argument 
with respect to claim 1 . The combination teaches dividing the input signal into frames 
and averaging the features over all the frames. 

33. Regarding claim 10, the further limitation of claim 1 , see the preceding argument 
with respect to claim 1 . The combination teaches a classification process using the 
feature vector, and this classification process determines a property class that describes 
the audio data (Col. 6, lines 7-1 0). 

34. Regarding claim 11, the further limitation of claim 1 , see the preceding argument 
with respect to claim 1 . In the combination, Blum teaches a feature vector, and Blum 
teaches that a vector is a row vector and not an NxM array (Col. 5, lines 52-55). Blum 
teaches a 1xN array, wherein it is inherent that N can be 25. 

35. Regarding claim 12, the further limitation of claim 1 , see Blum 

... wherein the audio data is formatted according to pulse code modulated format (Col. 5, lines 24-50 
and lines 64-66) 

In the combination, Blum teaches a plurality of input devices in the system, wherein it is 
well known that optical disks containing audio data are encoded. in a PCM format. 
Inherently Blum teaches this feature. 

36. Regarding claim 13, the further limitation of claim 12, see the preceding 
argument with respect to claim 12. In the combination, Blum teaches the use of a 
microphone and further teaches that a sound produced into the microphone can be 
searched (Col. 3, lines 52-55). It is inherent that the digitization step converts the 
analog waveform to a PCM format. 
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37. Regarding claim 15, the further limitation of claim 12, see the preceding 
argument with respect to claim 8. The combination teaches an FFT operation, which is 
performed on the audio data. 

38. Regarding claim 18, the further limitation of claim 1 , see Blum 

A computer readable medium bearing computer executable instructions for carrying out the method of 
claim 1. (Col. 5, lines 42-49) 

Blum teaches computer readable mediums with instructions for carrying out the method 
taught by the combination. 

39. Regarding claim 19, the further limitation of claim 1 , see the preceding argument 
with respect to claim 12. Blum teaches a communication adapter, where it is inherent 
that this adapter is a modem, which uses keying or modulation to transmit and receive 
data. It is also well known in the art that computer executable instructions can be 
transmitted in a network. It would have been obvious for one of ordinary skill in the art 
at the time of the invention to combine the teachings of Blum and the well-known 
technique of transmitting computer executable instructions over a network for the 
purpose of storing data a centralized server. A centralized server has many 
advantages, wherein one advantage is that one copy of a program can be sent to a 
plurality of remote computers. 

40. Regarding claim 20, the further limitation of claim 1 , see the preceding argument 
with respect to claim 18. The combination teaches a computing device for performing 
the method. 
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41. Claim 17 is rejected under 35 U.S.C. 103(a) as being unpatentable over the 
combination of Blum and Kjaer as applied to claim 1 above, and further in view of 
Gjerdingen. 

42. Regarding claim 17, the further limitation of claim 1 , see Gjerdingen 

. . . further comprising performing a principal component analysis process on the spectral feature vector 
(Col. 15, lines 37-44) 

Blum teaches a refining process on the feature vector, but does not teach principal 
component analysis (PCA). Gjerdingen teaches that PCA is used to reduce the 
complexity of the data being analyzed. It would have been obvious for one of ordinary 
skill in the art at the time of the invention to combine the teachings of Blum, Kjaer, and 
Gjerdingen for the purpose of reducing the complexity of the analysis. 



43. Claims 21-29, 34, and 35 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over the combination of Blum and Gjerdingen. 

44. Regarding claim 21, see Blum 

A method of classifying data according to spectral properties of the data, comprising: 

assigning at least one spectral properties class to each media entity of a plurality of media entities 
in a data set wherein said assigning is not based on digital signal processing; (Col. 21 , lines 55-58, 

line 64 - Col. 22, line 3 and Col. 22, lines 31-33) 

processing each media entity of said data set to extract at least one spectral properties 
characteristic based on digital signal processing of each media entity; (Col. 22, lines 45-48) 

generating a plurality of spectral properties vectors for said plurality of media entities, wherein 
each spectral properties vector includes said at least one spectral properties class and at least one 
spectral properties characteristic based on digital signal processing; and (Col. 22, lines 48-50) 

forming a classification chain based upon said plurality of spectral properties vectors and the at 
least one spectral properties class . (Col.- 22, lines 55-65) 

Blum teaches a method equivalent of classifying data according to its spectral 



properties and class with these features. 



However, Blum teaches a disjointed 
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approach, wherein the spectral properties class assigning that is not based on digital 
signal processing and the extraction of the spectral properties characteristic based on 
digital signal processing are not taught to be performed together in classifying (i.e. Blum 
teaches the use of DSP when the non-DSP classification method fails and does not 
positively say they are used together to classify signals). 

Gjerdingen teaches the use of DSP and non-DSP classification methods together 
to model, or classify, the signals (Col. 6, lines 38-64 and Fig. 4, units 401, 403A, 403B, 
and 404-406). It would have been obvious for one of ordinary skill in the art at the time 
of the invention to combine the teachings of Blum and Gjerdingen for the purpose of 
placing music under many searchable elements (i.e. searching by artist, mood, genre, 
sub-genre, etc..) (Gjerdingen, Col. 3, lines 23-67, Col. 8, lines 34-40, and lines 50-57). 

45. . Regarding claim 22, the further limitation of claim 21 , see 
. . . further comprising: 

processing an unclassified media entity to extract at least one spectral properties characteristic 
based on digital signal processing of the unclassified media entity; (Col. 21 , lines 55-58) 

generating a vector for the unclassified media entity including said at least one digital signal 
processing spectral properties characteristic; (Col. 21 , lines 58-60) 

presenting the vector for the unclassified media entity to the classification chain; and 

classifying the unclassified entry with an estimate of the spectral properties class by calculating 
the representative spectral properties class of the subset of the plurality of vectors of the classification 
chain located in the neighborhood of the vector for the unclassified entity. (Col. 21 , line 66 — Col. 22, 
line 3) 

Blum teaches these features in a method of classifying data. 

46. Regarding claim 23, the further limitation of claim 22, see Blum 

... further including calculating a neighborhood distance that defines a distance within which two vectors 
in the classification chain space are in the same neighborhood for purposes of being in the same spectral 
properties class. (Col. 22, lines 3-20) 

Blum teaches a calculation of a neighborhood distance. 
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47. Regarding claim 24, the further limitation of claim 22, see the preceding 
argument with respect to claim 23. Blum teaches classifying the entries according to 
statistical properties of the spectral properties of an entry, such as standard deviations 
or range values (Col. 21 , lines 61-63). It is inherent to use the median to describe 
skewed sample ranges (Col. 22, lines 21-26). 

48. Regarding claim 25, the further limitation of claim 22, see the preceding 
argument with respect to claim 23. Blum teaches a method of describing an 
unclassified entry according to a numerical value with these features. 

49. Regarding claim 26, the further limitation of claim 22,. see the preceding 
argument with respect to claim 31 . Blum teaches the features of the parent claims 21 
and 22, but Blum does not teach a level of confidence measure. Gjerdingen teaches a 
measure indicating the level of confidence regarding classification. 

50. Regarding claim 27, the further limitation of claim 21 , see Blum 

A computer readable medium bearing computer executable instructions for carrying out the method of 
claim 21. (Col. 5, lines 41-49) 

Blum teaches a computer readable medium bearing instructions for a method with the 
features of claim 21 . 

51 . Regarding claim 28, the further limitation of claim 21 , see Blum 

A modulated data signal carrying computer executable instructions for performing the method of claim 21. 
(Col. 5, lines 22-40). 

Blum teaches a communication adapter, wherein it is inherent that modulated signals 
are transferred using communication adapters. It is well known that computer 
instructions can be transferred over a network using communication adapters. It would 
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have been obvious for one of ordinary skill in the art at the time of the invention to 
combine the teachings of Blum and the well-known technique of transmitting computer 
executable instructions over a network for the purpose of storing data a centralized 
server. A centralized server has many advantages, wherein one advantage is that one 
copy of a program can be sent to a plurality of remote computers. 

52. Regarding claim 29, the further limitation of claim 21 , see the preceding 
argument with respect to claim 27. Blum teaches at least one computing device for 
performing the method. 

53. Regarding claim 34, Gjerdingen teaches human classification of the audio data 
(Fig 5a), wherein each entry is assigned at least one value. Blum teaches that the 
construction phase reduces the results to a set of numbers called the characteristic 
vector, wherein the vector is stored for later use (Col. 3, lines 10-21 and lines 30-34). 
Blum also teaches a classification phase comprising presenting an unclassified sound 
and calculating the characteristic vector (Col. 6, lines 24-30), and presenting the vector 
to the classification chain, which returns an estimate (Col. 6, lines 45-52). 

54. Regarding claim 35, the further limitation of claim 34, see the preceding 
argument with respect to claims 1-5 and 34. The combination of Blum and Gjerdingen 
teach these features in a method of classifying data. 

Conclusion 

55. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure: 
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Glaser et al., USPN 7,003,515 - teaches classification using vectors (see Brief 
Summary, col. 1-2); 

Logan et al., USPN 7,031,980 - teaches different spectral representations of the 
input signal and MFCCs (see Detailed Description, col. 5-6); and 

Forbes.com "MongoMusic Fans Include Microsoft" - teaches expert classification 
and DSP classification 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Daniel R. Sellers whose telephone number is 571-272- 
7528. The examiner can normally be reached on Monday to Friday, 9am to 5:30pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Sinh Tran can be reached on (571)272-7564. The fax phone number for the 
organization where this application or proceeding is assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 
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